Statistical Characterization of Transcription Start Sites in Plant Genomes
نویسندگان
چکیده
Although large amounts of genomic and full-length cDNA sequence data from plants are now publicly available, knowledge of the promoters and transcription start sites (TSSs) in plants is still limited compared to mammals, such as human and mouse. In a recent paper, a prominent GC-compositional strand bias or GC-skew (=(C-G)/(C+G)), where C and G denote the numbers of cytosine and guanine residues, was reported near the transcription start sites in Arabidopsis thaliana [6]. However, it is unclear whether other eukaryotic species have equally prominent GC-skews, and the biological meaning of this trait remains unknown. In this study, we conducted comparative analysis using sequences from various eukaryotic genomes animals, fungi, protists, and plants, to statistically characterize TSSs of plant genes. In addition, we explored the potential value of GC-skew as an index for TSS-prediction in plants genomes, where there is a lack of correlation among CpG -islands and genes.
منابع مشابه
Analysis of GC-compositional Strand Bias in the Transcription Start Sites of Plant and Fungal Genes
In a recent paper, a GC-compositional strand bias, or GC-skew (=(C-G)/(C+G)) was reported, where C and G denote the numbers of cytosine and guanine residues, respectively, near the transcription start sites (TSS) in Arabidopsis [4]. However, it is unclear whether other eukaryotic species have similar GC-skews, and the biological meaning of that remains unknown. In this study, we conducted compa...
متن کاملTSSer: an automated method to identify transcription start sites in prokaryotic genomes from differential RNA sequencing data
MOTIVATION Accurate identification of transcription start sites (TSSs) is an essential step in the analysis of transcription regulatory networks. In higher eukaryotes, the capped analysis of gene expression technology enabled comprehensive annotation of TSSs in genomes such as those of mice and humans. In bacteria, an equivalent approach, termed differential RNA sequencing (dRNA-seq), has recen...
متن کاملMycobacterium avium subsp. paratuberculosis induces differential cytosine methylation at miR-21 transcription start site region
Mycobacterium aviumsubspecies paratuberculosis (MAP), as an obligate intracellular bacterium, causes paratuberculosis (Johne’s disease) in ruminants. Plus, MAP has consistently been isolated from Crohn’s disease (CD) lesions in humans; a notion implying possible direct causative ...
متن کاملEuGene-PP: a next-generation automated annotation pipeline for prokaryotic genomes
UNLABELLED It is now easy and increasingly usual to produce oriented RNA-Seq data as a prokaryotic genome is being sequenced. However, this information is usually just used for expression quantification. EuGene-PP is a fully automated pipeline for structural annotation of prokaryotic genomes integrating protein similarities, statistical information and any oriented expression information (RNA-S...
متن کاملProfiling of Accessible Chromatin Regions across Multiple Plant Species and Cell Types Reveals Common Gene Regulatory Principles and New Control Modules.
The transcriptional regulatory structure of plant genomes remains poorly defined relative to animals. It is unclear how many cis-regulatory elements exist, where these elements lie relative to promoters, and how these features are conserved across plant species. We employed the assay for transposase-accessible chromatin (ATAC-seq) in four plant species (Arabidopsis thaliana, Medicago truncatula...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005